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SUBSTITUTED NUCLEIC ACID MIMICS 



FIELD OF THE INVENTION 

This invention is directed to the synthesis and 
use of nucleic acid mimics containing one or more 
5 heterocyclic base moieties substituted by chemical groups in 
order to diminish or prevent the formation of triplexes . 
This effect can be used to design antisense or probe 
reagents that avoid forming triplexes • 

BACKGROUND OF THE INVENTION 

10 In the art, there are known several nucleic acid 

mimics having nucleobases bound to backbones other than the 
naturally occurring ribonucleic acid or deoxyribonucleic 
acid backbones having the ability to bind to nucleic acids 
having a nucleobase sequence complementary to the base 

15 sequence of the nucleic acid mimic. Among these , only the 
peptide nucleic acids (PNA's) as described, for example, in 
WO 92/20702 have demonstrated a likelihood for potential use 
as therapeutic and diagnostic reagents. This may be due to 
their ability to bind nucleic acids (NAs) of complementary 

20 nucleobase sequence with a higher affinity than shown by the 
corresponding wild- type nucleic acid. 
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One of the unique properties of PNAs is their 
ability to form PNA 2 -NA triplexes that are more stable than 
the corresponding PNA-NA duplexes. This ability can be used 
advantageously for various purposes including PCR clamping 
5 (WO 93/25706) . However, there are some drawbacks for 

applications that require sequence selection, because such 
selection would be biased for triplex forming sequences. 
Therefore, there is a need for PNAs that do not form such 
triplexes. 



10 OBJECTS OF THE INVENTION 

It is an object of this invention to provide 
substituted nucleic acid mimics that do not preferentially 
form triplexes with nucleic acids. 

It is a further object of this invention to 
15 provide methods for sequence selective determination of 
nucleic acids. 

It is yet a further object of this invention to 
provide therapeutic, diagnostic and research reagents that 
can modulate the expression of nucleic acids which encode 
20 proteins suspected of causing or indicating the existence of 
a disease state. 



BRIEF DESCRIPTION OF THE INVENTION 

In accordance with this invention there are 
provided nucleic acid mimics containing one or more 

25 heterocyclic bases substituted by a sterically bulky 

substituent at a position which is 1, 2 or 3 atoms removed 
from the atom of the base which is attached to the backbone. 

Further there are provided methods for 
disfavouring the formation of triplex structures comprising 

3 0 a nucleic acid strand and two strands of a nucleic acid 

mimic, having a base sequence complementary to the nucleic 
acid strand. Such methods include incubating a mixture of 
the nucleic acid and the nucleic acid mimic under conditions 
suitable for forming a nucleic acid/nucleic acid mimic 

35 duplex. The formation of triplexes is avoided by providing 
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sterically bulky substituents on the nucleic acid mimic 
which are located at positions that would be in close 
proximity to each other if bound to nucleic acid in a 
triplex. 

5 In accordance with this invention there are 

provided methods for the determination of a nucleic acid by 
providing a nucleic acid mimic substituted at positions 
which are 1, 2 or 3 atoms removed from the atom of the base 
which is attached to the backbone. Said nucleic acid mimic 

10 is incubated with the nucleic acid under conditions suitable 
for the formation of a duplex between the nucleic acid mimic 
and the nucleic acid. The occurrence of the duplex is 
related to the identity or existence of the nucleic acid. 

The present invention provides nucleic acid mimics 

15 for modulating the expression of nucleic acids that encode 
proteins which are suspected of producing a disease state in 
mammals . The nucleic acid mimics of this invention can be 
used in therapeutics, diagnostics and as research reagents. 

One favourable aspect of this invention is that 

20 nucleic acid mimics substituted as described herein 

substantially retain the ability to form duplexes with good 
efficiency and discrimination comparable to the 
corresponding unsubstituted nucleic acid mimic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 Figure 1 is a schematic illustrating an exemplary 

synthesis of a PNA monomer containing cytosine substituted 
at the N 4 position. 

Figure 2 is a schematic illustrating the Watson- 
Crick base pairing between N 4 substituted cytosine of a PNA 

30 and guanosine of a DNA. 

DETAIIiED DESCRIPTION OF THE INVENTION 

In accordance with this invention, novel compounds 
are provided that are useful for disfavouring the formation 
of triplexes with nucleic acids. A nucleic acid mimic in 
35 accordance with the invention is a molecule having a 
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sequence of modified heterocyclic bases, preferably 
naturally occurring bases, e.g. those which occur in "wild- 
type" nucleic acids, bound to a non-naturally occurring 
backbone. The nucleic acid mimics bind to a nucleic acid 
5 having a complementary base sequence through base pairing. 

Preferred nucleic acid mimics are molecules 
wherein the base moieties are bound to the backbone via an 
amine nitrogen atom of the backbone. Preferred backbone 
structures for the mimics are described in WO 92/20702, 
10 United States Patent application Serial No. 08/054,363, 
filed April 26, 1993, United States Patent application 
Serial No. 08/319,411, filed October 6, 1994 and United 
States Patent application Serial No. 08/366,231, filed 
December 28, 1994. The above -referenced disclosures are 
15 herein incorporated by reference. 

Heterocyclic bases of the nucleic acid mimics of 
the present invention are heterocyclic moieties that are 
able to base pair with nucleobases of a nucleic acid by 
hydrogen bonding. In the case of triplex formation, two 
20 kinds of interactions are involved; Watson-Crick binding and 
Hoogsteen binding. The formation of triplexes between PNA 
and NA is described in WO 95/01370. 

The term "heterocyclic moiety" or "heterocyclic 
base" includes the naturally occurring purine and pyrimidine 
25 nucleobases. For the purpose of this invention, the term 
"pyrimidine" refers to any 1,3-diazine irrespective of its 
substituents . The naturally occurring pyrimidine 
nucleobases are cytosine, thymine and uracil. Naturally 
occurring purine nucleobases include adenine and guanine. 
30 The term "heterocyclic moiety" or "heterocyclic base" also 
includes non-naturally occurring nucleobases. An example of 
a non-naturally occurring base is a base in which any of the 
ring atoms of the nucleobases is replaced by another atom. 
For example, CH may be replaced by N and vice versa. Such 
35 modifications can occur at more than one position. Another 
example of a non-naturally occurring base is a base in which 
the 2- and 4 -substituents of a naturally occurring base are 
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reversed. Structures of naturally and non-naturally 
occurring pyrimidine bases are shown below (the third 
structure from the left is that of a non-naturally occurring 
pyrimidine base known as pseudo-isocytosine) : 



NHj 
4 



N 




O 




N 



5 
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HN 



KJ 




NH 2 



NT 




In the invention, the heterocyclic moiety is 
attached to the backbone at a specific ring position of the 
heterocycle. In the case of substituted naturally occurring 
nucleobases, this position is preferably occupied by a 
10 nitrogen atom. According to this invention, the sterically 
bulky substituent can be attached to the heterocyclic moiety 
at a position which is 1, 2 or 3 atoms removed from the 
position of attachment of the heterocyclic moiety to the 
backbone. In case of the pyrimidine bases, positions 
15 conventionally numbered as ring position 4, 5 and 6 are 

preferred. The 4 -position is most preferred for attaching a 
bulky substituent. Some effect on triplex formation may 
also occur when the substituent is attached to the 5- and 6- 
positions, but in this case, the substituents should be 
20 sterically bulkier than substituents located at position 4. 
In the case of non-naturally occurring bases, positions 
corresponding to pyrimidine positions 4, 5 and 6 in their 
spatial orientation are also preferred. In case of 
substitution on the 5 -position of a non-naturally occurring 
25 base, the triplex formation is pH dependent as it is for a 
naturally occurring base such as cytosine. Duplex formation 
is likely not effected by pH in any case. 
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Shown above are formulae of heterocyclic bases having 
substituents designated R. Each R can independently be H, 
-NO,-N0 2 , -S0 3 , -CN, -OH, -SH, -P0 3 2 \ -COOH, -R' # -F, -Cl, 
5 -Br, -I, -O-R', -S-R' , ~N(R') 2 , -C(R') 3 , -C(=X)(R'), 

C{=X) (-Y-R' ) f S(=Z) X _ 2 (-Y-R') , in which Z is O, X is O, S or 
NH, and Y is O, S or NH, wherein at least one R is a 
sterically bulky group. Preferred bulky groups contain 3 
non-hydrogen atoms or more, most preferred bulky groups 

10 contain 6 non-hydrogen atoms or more and are preferably 
cyclic and/or aromatic. It will be apparent from the 
description of this invention that these preferred 
definitions apply to the case wherein at least one R 
substituent is different from hydrogen. In case 2 or more R 

15 groups are bulky, the spatial requirements for achieving 
inhibition may be reduced, for example, from 6 atoms to 3 
atoms . 
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It is preferred that R groups are acyl groups, 
especially aromatic acyl groups. It is especially preferred 
that the acyl groups be bound to a nitrogen atom at position 
4 of a pyrimidine base. An especially preferred acyl group 
5 is the benzoyl group. 

R' is preferably selected from H; alkyl, alkenyl 
or alkynyl (each having from 1-50 C atoms) ; aryl, naphthyl, 
biphenyl or tolyl (each having from 6-50 C atoms) . These 
groups may be straight or branched chain, symmetric or 

10 asymmetric, chiral or achiral, and may contain one or more 
heteroatoms selected from N, NH, S and O, and may also 
comprise fused aromatic systems. R' may be heterocyclic, 
including pyridyl, imidazolyl, pyrimidinyl, pyridazinyl, 
guinolyl, acridinyl, imidazolyl, pyrrolyl, furanyl, thienyl, 

15 isoxazolyl, oxazolyl, thiazolyl or biotinyl and may be bound 
or fused to any available position. 

R' may be substituted, preferentially with one or 
more lower organic groups (up to 10 carbon-atoms) or 
derivatives thereof which enhance the triplex inhibiting 

20 effect or are otherwise useful herein. These may be groups 
such as alkyl, alkenyl, alkynyl, aryl, naphthyl, biphenyl, 
tolyl, benzyl, and groups such as -N0,-N0 2 , -S0 3 , -CN, -OH, 
-SH, -P0 3 2 \ -COOH, -P, -CI, -Br, and -I . 

Compounds of the present invention can be 

25 conveniently prepared according to the methods described in 
WO 92/20702. An especially preferred method of synthesis 
uses, in a first step, the synthesis of the base substituted 
by the sterically bulky substituent, preferably having also 
attached a reactive group and/or a linker moiety for 

3 0 attachment of the modified base to a monomeric backbone 
unit, for example, protected N-aminoethylglycine . In a 
second step, bases are attached via the linker moiety to a 
nitrogen atom at the preformed and protected monomeric 
backbone unit. In a third step, the base -containing monomer 

35 is prepared for oligomerization with other bases containing 
monomeric backbone units or an already formed oligomer, e.g. 
cleaving of protecting groups at one end of the backbone 
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unit and/or activating this end for oligomerization. In a 
fourth step, the base -containing monomers are oligomerized 
depending upon the sequence requirements for complementarity 
for duplex formation with a complementary nucleic acid. 

Preferred monomeric backbone units that may be 
protected with a protecting group appropriate for the active 
groups during synthesis of the monomeric backbone unit are 
compounds of the general formula: 

R3 
■I 

R1 /N "R2 

wherein : 

R 1 is Ci-C 4 alkyl substituted by - COOP 1 , -NHP 1 , -OP 1 
or SP 1 , wherein P 1 is hydrogen or a protecting group; 

R 2 is Ci-C 4 alkyl substituted by -COOP 2 , -NHP 2 , -OP 2 
or SP 2 , wherein P 2 is hydrogen or a protecting group; 

M is a naturally or non-naturally occurring 
heterocyclic moiety bound by a linker to nitrogen, said 
linker being 1-3 atoms in length; and 

R 3 is a sterically bulky substituent containing at 
least 3 or more non- hydrogen atoms. 

Monomers which are not substituted by R 3 are 
disclosed in WO 92/20702. In a preferred case, R 1 contains 
the group -COOP 1 and R 2 contains the group -NHP 2 , wherein the 
protecting groups (P 1 and P 2 ) are cleavable under different 
reaction conditions from each other. 

For example, in certain preferred embodiments, 
peptide nucleic acid backbones may be employed. Such 
backbones have the general formula (I) : 
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L 1 L 2 L n 

I< I' }- 

i 1 r 1 1 2 r 2 R n | 

C 1 "D 1 ^C 2 ^D 2 ~C n D n 

(I) 



wherein: 



n is at least 2, 
5 each of h x -h n is independently selected from the 

group consisting of hydrogen, hydroxy, (C x -C 4 ) alkanoyl , 
naturally occurring nucleobases, non -naturally occurring 
nucleobases, aromatic moieties, DNA intercalators , 
nucleobase-binding groups, heterocyclic moieties, and 

10 reporter ligands, at least one of L 1 -^ being a naturally- or 
non-natur ally- occurring nucleobase substituted with a 
sterically bulky group as described herein; 

each of C^-C 1 is (CR 6 R 7 ) y where R 6 is hydrogen and R 7 
is selected from the group consisting of the side chains of 

15 naturally occurring alpha amino acids , or R 6 and R 7 are 
independently selected from the group consisting of 
hydrogen, (C 2 -C 6 ) alkyl, aryl, aralkyl, heteroaryl, hydroxy, 
(C 1 -C 6 )alkoxy f (CVC^) alkylthio, NR 3 R 4 and SR 5 , where R 3 and R 4 
are as defined above, and R 5 is hydrogen, (C^-Cg) alkyl, 

20 hydroxy-, alkoxy- , or alkylthio- substituted (C x -C 6 ) alkyl , or 
R € and R 7 taken together complete an alicyclic or 
heterocyclic system; 

each of D 1 -D n is (CR 6 R 7 ) Z where R 6 and R 7 are as 
defined above; 

25 each of y and z is zero or an integer from 1 to 

10, the sum y + z being greater than 2 but not more than 10; 

each of G 1 -G n_1 is ~NR 3 CO- , -NR 3 CS- , -NR 3 S0- or - 
NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 

each pair of A 1 -A n and B 1 -B n are selected such that : 
30 (a) A is a group of formula (Ila) , (lib) or (lie) 

and B is N or R 3 N + ; or 

(b) A is a group of formula (lid) and B is CH; 
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R 2 Jq 



(Ha) 



(lib) 



10 



R 2 r 



R 2 s 



I I! 

-H C- 



R 2 s 



X 

II 
-C M 



(lie) did) 

where : 

X is O, S, Se, NR 3 , CH 2 or C(CH 3 ) 2 ; 
Y is a single bond, O, S or NR 4 ; 

each of p and q is zero or an integer from 1 to 5, 
the sum p+q being not more than 10; 

each of r and s is zero or an integer from 1 to 5, 

the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from the 
group consisting of hydrogen, ( Cl -C 4 ) alkyl which may be 
hydroxy- or alkoxy- or alkyl thio- substituted, hydroxy, 
15 alkoxy, alkylthio, amino and halogen ; 

each of G 1 -G n " 1 is -NR 3 CO- , -NR 3 CS-, -NR 3 SO- or - 
NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 

Q is -C0 2 H, -CONR'R", -S0 3 H or -S0 2 NR'R" or an 
activated derivative of -C0 2 H or -S0 3 H; and 

I is -NHR"'R"" or -NR" ' C (O) R" " , where R' , 
r« , R' " and R" " are independently selected from the group 
consisting of hydrogen, alkyl, amino protecting groups, 
reporter ligands, intercalators , chelators, peptides, 
proteins, carbohydrates, lipids, .steroids, oligonucleotides 
25 and soluble and non-soluble polymers. 

in certain embodiments, at least one A is a group 
of formula (He) and B is N or R 3 N*. In other embodiments, A 



20 
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is a group of formula (Ila) or (lib) , B is N or R 3 N + , and at 
least one of y or z is not 1 or 2 . 

Some preferred peptide nucleic acids have general 
formula (Ilia) or (Illb) ; 




(Illb) 

wherein: 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic base moieties, 
including those substituted with a sterically bulky group or 
groups, naturally occurring nucleobases, and non-naturally 
occurring nucleobases; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 

n is an integer from 1 to 60; 
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each of k, 1, and m is independently zero or an 
integer from 1 to 5; 

p is zero or 1; 

R h is OH, NH 2 or -NHLysNH 2 ; and 
5 R 1 is H or COCH 3 . 

Particularly preferred are compounds having 
formula (Ilia) or (Illb) wherein each L is independently 
selected from the group consisting of the nucleobases 
thymine (T) , adenine (A) , cytosine <C) , guanine (G) and 
10 uracil (U) , especially where one or more are modified with a 
sterically bulky substituent in accordance with this 
invention, k and m are zero or 1, and n is an integer from 1 
to 30, in particular from 4 to 20. 

The peptide nucleic acids of the invention can be 
15 synthesized by adaptation of standard peptide synthesis 
procedures, either in solution or on a solid phase. The 
synthons used are specially monomer amino acids or their 
activated derivatives, protected by standard protecting 
groups. The oligonucleotide analogs also can be synthesized 
20 by using the corresponding diacids and diamines. 

Thus, monomer synthons useful for incorporation 
into PNA of the preceding formulae include those selected 
from the group consisting of amino acids, diacids and 
diamines , having general formulae : 

25 L L \ 



A 



A * 



K C /K D / F or K C ^S/ E or F ^C^D /F 

(IV) (V) (VI) 

wherein L, A, B, C and D are as defined above, except that 
any amino groups therein may be protected by amino protec- 
3 0 ting groups; E is COOH, CSOH, SOOH, S0 2 OH or an activated 
derivative thereof; and F is NHR 3 or NPgR 3 , where R 3 is as 
defined above and Pg is an amino protecting group. 
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Preferred monomer synthons according to the 
invention include those having formula (Villa) - (VIIIc) : 



L 




(Villa) 




(VHIb) 
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(VIIIc) 

or amino-protected and/or acid terminal activated der- 
ivatives thereof, wherein L is selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
naturally occurring nucleobases, and non-naturally occurring 
nucleobases; and R 7 ' is selected from the group consisting of 
hydrogen and the side chains of naturally occurring alpha 
amino acids. 

Also useful in the present invention are chiral 
PNA backbones. Such backbones are preferably derived from 
two or more monomers, at least one of which contain a 
aliphatic cyclic structure. Representative of such monomers 
are those of formula: 



B 




wherein : 
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B is a naturally or non- naturally occurring 
nucleobase which may be substituted with a sterically bulky 
group in accordance with this invention; 

at least one of Co: or CjS is in the S 
5 configuration; 
and 

n is 0, 1, 2 , or 3. 

In preferred embodiments C<x and C/3 are in the S 
configuration. In further preferred embodiments of the 

10 invention B is adenine, cytosine, guanine, thymine, or 
uracil. In more preferred embodiments n is 2 . 

In further preferred embodiments the peptide 
nucleic acid oligomers contain at least one peptide nucleic 
acid monomer having a ( 2 -aminoethyl) glycine backbone with a 

15 chiral center in the ethyl portion of the backbone. The 

monomer is incorporated into peptide nucleic acid oligomers 
at a position corresponding to a region of variability in 
the target molecule. 

One nucleic acid mimic can contain one or more 

20 nucleobases modified as described above. It was found that 
increasing the number of nucleobases containing sterically 
bulky substituents within one nucleic acid mimic inhibited 
triplex formation while retaining the ability to form 
duplexes . 

25 In order to achieve the inhibition of triplex 

formation the nucleic acid mimic and the position of 
attachment of the sterically bulky group are chosen such 
that the heterocyclic bases to which the sterically bulky 
substituent is attached would be located in close proximity 

30 to each other when bound to the nucleic acid, were a triplex 
to form. Preferably the substituted bases on the nucleic 
acid mimics should, in the hypothetical triplex, be located 
on the same side, i.e. base pairing to the same nucleobase 
of the nucleic acid strand. This case wherein the 

35 substituted bases of the mimic would base pair to the same 
base on the nucleic acid strand will be termed as "opposed" . 
That the substituted bases would have to base pair with a 
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predefined base on the nucleic acid strand can be achieved 
by choosing the base sequence and orientation of the mimics 
such that only the triplex formation could occur in a way 
which is inhibited by the use of the sterically bulky 
5 substituents . 

The compounds of the present invention can be used 
in methods for the determination of a nucleic acid 
comprising a nucleic acid mimic substituted at positions 
which are 1, 2 or 3 atoms removed from the atom of the base 

10 which is attached to the backbone, incubating said nucleic 
acid mimics and said nucleic acid under conditions suitable 
for the formation of a duplex between said nucleic acid 
mimic and said nucleic acid and determining the occurrence 
of said duplex as a measure of the occurrence of said 

15 nucleic acid. These methods are believed to function 

according to the principles described in WO 92/20703 (herein 
incorporated by reference) by replacing the compounds used 
in the prior art with the compounds described herein above . 
It is especially preferred to use a nucleic acid mimic which 

20 is labeled with a reporter group either at one of the 

termini of the nucleic acid mimic or at any position of the 
backbone or the base moieties . A reporter group according 
is a group that can be detected, for example a fluorescent 
group like fluorescein, or one which can be detected by a 

25 further compound which is bound in a subsequent step to the 
reporter group. For example, if the sterically bulky 
substituent is a biotin group or a group containing a biotin 
group, the nucleic acid mimic, and thereby the nucleic acid 
can be determined by adding detectable streptavidin to the 

30 hybrid. It is preferred to remove any excess biotin- labeled 
nucleic acid mimic from the mixture prior to this 
incubation. The reporter group is then detected by means 
which are known to the art -skilled. 

The present invention is suitable for detection of 

35 expression of a disease -causing protein in a cell or tissue 
sample from patients who have a disease state. A number of 
assays may be formulated for the inhibition of protein 
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expression employing the present invention, which assays 
will commonly comprise contacting a cell or tissue sample 
with a nucleic acid mimic of the invention under conditions 
selected to permit detection, and usually quantitation, of 
5 such inhibition. As described below, fluorescein- labeled 
nucleic acid mimics are prepared and contacted with a cell 
or tissue sample suspected of expression of a disease- 
causing protein. The sample is then washed to remove 
unbound nucleic acid mimic. Fluorescence remaining in the 
10 sample, detected and guantitated by fluorimetry, indicates 
bound nucleic acid mimic (which in turn indicates the 
presence of nucleic acid encoding the disease-causing 
protein) . 

The compounds of the present invention may be 
15 useful in binding to target molecules. Target molecules of 
the present invention can include any of a variety of 
biologically significant molecules. Such target molecules 
may be nucleic acid strands such as significant regions of 
DNA or RNA which encode proteins that may be responsible for 
2 0 causing and/or maintaining a disease state in mammals. Such 
other target molecules may be transcription factors. Target 
molecules can be carbohydrates, glycoproteins or other 
proteins. In some preferred embodiments, the target 
molecule can be a protein such as an immunoglobulin, 
25 receptor, receptor binding ligand, antigen or enzyme, and 
more specifically can be a phospholipase, tumor necrosis 
factor, endotoxin, interleukin, plasminogen activator, 
protein kinase, cell adhesion molecule, lipoxygenase, 
hydrolase or transacylase . In other embodiments of the 
30 invention, the target molecule may be an important region of 
the human immunodeficiency virus, Candida, herpes viruses, 
papillomaviruses, cytomegalovirus, rhinoviruses, hepatitis 
viruses or influenza viruses. In yet other embodiments of 
the invention, the target molecule may be a region of an 
35 oncogene. 

The following examples further illustrate the 
invention and are not intended to limit the same. 
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EXAMPLES 
EXAMPLE 1 

A. Exemplary General Syntheses 

Phosphoramidates were purchased from Cruachem (UK) 
5 and the DNA oligomers were assembled on a MilliGen/Biosearch 
8700 DNA synthesizer. The A, C, G and T PNA monomers were 
purchased from Biosearch (USA) . N' -Boc-aminoethyl glycine 
was purchased from Biosearch (USA) . All PNA oligomers were 
synthesized on a custom-made PNA synthesizer (Biosearch, 

10 USA) by a modified Merrif ield method (Christensen, L. , 
Fitzpatrick, R. , Gildea, B., Warren, B- and Coull, J, 
(1994) , Innovations and Perspectives in Solid Phase 
Synthesis, R. Epton, Ed., SPCC (UK) Ltd., Oxford, England; 
Christensen et al., (1995), J. Pep. Sci., 3, 175) and 

15 purified by reverse phase-HPLC. The PNA oligomers were 
characterized by FAB + MS. 

B. T m Measurements 

Absorbance versus temperature was measured at 
260 nm using a Guilford Response spectrophotometer. Heating 

20 rate was 0.5°C/min from 5-90°C. PNA oligomers were 

hybridized with complementary DNA sequences in a medium salt 
buffer containing 100 mM NaCl, 10 mM sodium phosphate and 
0.1 mM EDTA, pH was adjusted to 5, 7 or 9, as desired. The 
samples were heated to 90 °C for 5 min, slowly cooled to 20° 

25 and left at 4°C for 30 min prior to T m measurements. 

C. Synthesis of modified cytosine monomer 

(i) Benzoyl cytosin-l-ylacetate (1) 
Reference is made to Figure 1 where to cytosine 
(20 g, 0.18 mol) in 400 mL DMF was added 7.2 g (0.18 mmol) 

3 0 of NaH (disp. in oil 60%) . The mixture was heated to 50 °C 
and stirred for 2 h under nitrogen. After cooling to room 
temperature, 29 mL (1.1 eq.) of benzyl bromoacetate was 
added over 2 h. After stirring overnight, the dark 
suspension was filtered and the filtrate washed with cold 

35 DMF and 0.2 M sodium bicarbonate. The product (1) was 

crystallized from ethanol. Yield: 37 g (79%) . X H NMR <d 6 - 
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DMSO) : 6 4.56 (s, 2 H, CH 2 0) , 5.24 (s, 2 H, CH 2 CO) , 5.77 <d, 
1 H, H 5 ) , 7,20 (dd, 2 H, NH 2 ) , 7.45 (m, 5 H, aromatic), 7.65 
(d, 1 H, H 6 ) . MS (FAB) m/z 260 (M+H) + (calcd 260). 

{ii) (N 4 - (Benzoyl) cytos in- 1 yl) acetic Acid (2) 
5 To a solution of (1) (10 g, 38 mmol) in 10 mL 

pyridine was added 6.6 g (47 mmol) of benzoyl chloride and 
stirred overnight at room temperature. The solution was 
evaporated under reduced pressure. The residue was 
dissolved in 1 M KOH and stirred for 3 h after which the Ph 
10 was adjusted to 2 with cone. HCl. The target compound (2) 
precipitated out. Yield: 9.3 g (90%). 1 H NMR (d 6 -DMSO) : 6 
4.59 (s, 2 H, CH 2 0) , 7.31 (d, 1 H, H 5 ) , 7.5-8.2 (7 H, 

^ aromatic, NH, H 6 ) . MS (FAB) m/z 273 (M+H) + (calcd 273). 

£ (iii) N- { (N 4 - (Benzoyl) cytosin-l-yl) acetyl) -N- (2- 

^ 15 Bocaminoethyl) glycine (3) 

.-SJ8K 

rjy 4.8 g (22 mmol) of Methyl N- (2-Boc-aminoethyl) - 

glycinate (2) , 2.4 g (14.7 mmol) of benzyloxycarbonyl 
;| chloride, 2.9 g (14.9 mmol) of DCC and 2.4 g (14.7 mmol) of 

^ DhBtOH was dissolved in 50 mL of DMF and stirred for 4 h at 

20 room temperature. Dichloromethane (100 mL) was added and 
fy the mixture extracted with 3 x 0.2 M sodium bicarbonate, 

"2 2 x 1 M sodium hydrogen sulfate and brine. The organic 

,15 phase was dried with magnesium sulfate and evaporated to 

dryness under reduced pressure. The residue was dissolved 
25 in 2 M KOH and stirred for 1 h after which the pH was 
adjusted to 2 with 1 M HCl, whereby the target compound 
precipitated. The product (3) was crystallized from 
methanol: ethyl acetate: hexane (1:2:2). Yield: 4.2 g 
(60%). X H NMR (d 6 -DMSO) : 6 1.45 and 1.47 <d, 9 H, Boc) , 
30 3.28-3,53 (m, 4 H, CH 2 ) , 4.08 and 4.31 (s, 2 H, CH 2 CO) , 4.75 
and 4.95 (s, 2 H, CH 2 CO) , 6.83 and 7.03 (m, 1 H, BocNH) , 7.38 
(m, 1 H, H 5 ) , 7.57-8.10 (m, 6 H, aromatic and H 6 ) . MS (FAB) 
m/z 474 (M+H) + (calcd 474). 
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EXAMPLE 2 

Triplex inhibition 

The effect of the benzoylated cytosine (C Bz ) 
residue on the hybridization properties of a homopyrimidine 
5 peptide nucleic acid was studied. PNA1, H - TTTTCCTCTC - Ly sNH 2 , 
was synthesized containing either C Bz in position 6 (PNA2) , 
or two C BZ residues in positions 6 and 8 (PNA3) or in 
positions 5 and 6 (PNA4) . These PNAs were hybridized to a 
complementary oligonucleotide in the parallel (ODN1) or the 

10 antiparallel (ODN2) configuration and the thermal stability 
(T m ) of the resulting complexes was determined at pH 5, 7, 
and 9 . The results are set forth in Table 1 . Absorbance 
versus temperature curves were measured at 260 nm in 100 mM 
NaCl, 10 mM sodium phosphate and 0.1 niM EDTA. Heating rate: 

15 0.5°/minute at 5-90°C. The T m s in parentheses were obtained 
by cooling from 90° to 10°C while measuring the absorbance at 
260 nm . 

TABLE 1 

Melting temperatures T TO (°C) for binding of PNA to single 
20 stranded homopurine DNA oligomer. 



Sequence 


PH 


0DN1 


ODN1 


PNA1 


5 


>85. 0 


69.5 




7 


58.5 (31.0) 


40.5 




9 


26.0 


33.5 


PNA2 


5 


56.0 (38.0) 


54.0 (42.5) 




7 


27.0 (20.0) 


32.0 (29.0) 




9 




31.0 (29.0) 


PNA3 


7 


28.0 


33.0 


PNA4 


7 


26 .0 


32.5 



Oligodeoxynucleotides : 



ODN1 = 5 ' - AAAAGGAGAG- 3 ' ; Seq. ID No : 1 
ODN2 « 5 ' -GAGAGGAAAA- 3 ' ; Seq. ID No : 2 
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Nucleic acid mimics : 

PNA1 - H-TTTTCCTCTC-LysNH 2 ; Seq. ID No : 3 

PNA2 = H - TTTTCC BZ TCTC - Ly sNH 2 ; Seq. ID No: 4, where C Bz is N 
PNA3 * H-TTTTCC B5! TC Bz TC-LysNH 2 ; Seq. ID No: 5, where C Bz is N 
5 PNA4 « H- TTTTC Bz C Bz TCTC- LysNH 2 ; Seq. ID No: 6, where C Bz is N 



Unmodified PNA1 exhibited expected behaviour. 
First, pronounced pH dependence was observed which is 
compatible with PNA 2 -DNA triplex formation requiring cytosine 
protonation. Second, the parallel complex showed highest 

10 stability at pH 5 and 7, but not at pH 9. These results 

suggest that triplexes are the most stable complexes at pH 5 
and 7, while the (antiparallel) duplex is more stable at 
pH 9. Triplex formation at pH 7 is also consistent with 
pronqunced hysteresis (« 27°C) observed at this pH. 

15 PNA2, containing one C Bz residue, apparently also 

formed a triplex at pH 5 as judged by the hysteresis, but 
the T m was lower (« 30°C) than that of the PNA1 complex. 
Thus, the benzoyl groups do indeed appear to interfere with 
efficient triplex formation. This effect is especially 

20 pronounced at pH 7. Only slight hysteresis is observed and 
notably the antiparallel complex shows higher stability 
which does not decrease at more alkaline conditions (pH 9) . 
These results strongly argue in favour of the duplex being 
the most stable complex at pH 7 with this PNA. 

25 The complexes with PNA1 and PNA2 showed equal 

thermal stability at pH 9, i. e. for the duplex, thus 
indicating that the C Bz residue does not interfere with 
Watson-Crick base pairing in the PNA-DNA duplex. This 
conclusion was supported by experiments with a C Bz containing 

3 0 mixed purine /pyrimidine sequence using the PNA oligomers 
H-AGT CAC CTA C-LysNH 2 { PNA5 ) and H-AGT CA C Bz CTA C-LysNH 2 
(PNA6) , and is set forth in Table 2. Absorbance versus 
temperature curves were measured at 26 0 ntn in 100 mM NaCl, 
10 mM sodium phosphate and 0.1 mM EDTA, at pH 7. Heating 

35 rate: 0.5%/min at 5-90°C. The T m s in parentheses were 
obtained by cooling from 90 to 10 °C while measuring the 
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absorbance at 2 SO nm. The hysteresis of the system is the 
difference between the T m (10-90°) and T m (90-10°) . 



TABLE 2 



Melting temperatures T m (°C) for binding of PNA in duplex 
5 mode to single- stranded DNA oligomer. 





PNA5 


PNA6 


0DN3 


49 (48) 


50 


ODN4 


33 (31) 


34 



Oligodeoxynucleotides ; 
10 ODN3 - 5 ' -GTAGGTCACT-3 ' ; Seq. ID No : 7 
ODN4 = 5 ' -GTAGATCACT-3 ' ; Seq. ID No : 8 
Nucleic acid mimics : 

PNA5 = H- AGTCACCTAC-LysNH 2 ; Seq. ID No : 9 

PNA6 = H-AGTCAC Bz CTAC-LysNH 2 ; Seq. ID No: 10, where C Bz is N 

15 Both of these oligomers form highly stable 

duplexes with their antiparallel oligonucleotide target. 
The stoichiometry of these complexes was determined by Job- 
plots as 1:1 complexes in both cases. The insignificant 
difference in T m s of the complexes between PNA5 and PNA6 with 

20 ODN3 falls within experimental error and can be interpreted 
as evidence of the structure shown in Figure 2 7 by which the 
benzoyl group is positioned in the major groove not 
interfering with the Watson- Crick base pairing. This is 
also in full agreement with the (G->A) mismatch positioned 

25 opposite the cytosine in the DNA strand, giving rise to a 
drop in T m of 15-16° for both PNA5 and PNA6 . An important 
feature distinguishing duplexes from triplexes under the 
experimental conditions is the very small hysteresis (less 
than 2°) obtained with duplexes when going from high to low 

3 0 temperature, whereas PNA: DNA triplexes showed pronounced 

hysteresis typically in the range of 20-30° (Table 2) . This 
is also evident for the complexes between PNA6 and ODN3 or 
ODN4 in which a hysteresis of 1-2°C was observed. The small 
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hysteresis obtained with PNA6 also indicated that the 
benzoyl group does not interfere significantly with the 
binding kinetics. 

EXAMPLE 3 

5 Coupling of nucleic acid mimic to fluorescein 

A nucleic acid mimic having a free amine moiety is 
dissolved in THF : H 2 0 to provide a solution that is 0.1 M of 
nucleic acid mimic. To the nucleic acid mimic solution is 
added fluorescein isothiocyanate, providing a solution that 
10 is 0.1-1.0 M in fluorescein isothiocyanate* The resultant 
reaction mixture is stirred for 0.1-2 hours and concentrated 
under reduced pressure . The residue is purified by 
preparative HPLC. 

EXAMPLE 4 

15 Detection of mutant /S-amyloid precursor protein gene 
expression (£APP) 

Point mutations in the gene encoding j8- amyloid 
have been implicated in familial Alzheimer's disease (FAD). 
Nucleic acid mimics are labeled with fluorescein or other 

20 fluorescent tags, as illustrated in Example 3 above. The 

fluorescent ly- labeled nucleic acid mimics are contacted with 
•a cell or tissue sample suspected of abnormal /3APP 
expression under conditions suitable for specific 
hybridization of the nucleic acid mimic to the nucleic acid 

25 encoding abnormal jSAPP. The sample is then washed to remove 
unbound nucleic acid mimics. Label remaining in the sample 
indicates bound nucleic acid and is quantitated using a 
fluorimeter, fluorescence microscope or other routine means. 

A first sample of cells or tissues suspected of 

3 0 expressing a point mutation in the £APP gene is incubated 
with a fluorescein- labeled nucleic acid mimic which is 
targeted to the mutant codon 717, codon 670 or codon 671 of 
the jSAPP mRNA. A second identical sample of cells or 
tissues is incubated with a second labeled nucleic acid 
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mimic which is targeted to the same region of normal £APP 
mRNA under conditions in which specific hybridization can 
occur. The sample is then washed to remove unbound nucleic 
acid mimic . Label remaining in the sample indicates bound 
5 nucleic acid and is quantitated using a fluorimeter or other 
routine means. The presence of mutant jSAPP is indicated if 
the first sample retains labeled nucleic acid mimic and the 
second sample does not retain labeled nucleic acid mimic. 

EXAMPLE 5 

10 Detection of mutant H-ras gene expression 

Point mutations in the H-ras gene have been 
implicated in numerous aberrations of the ras pathway. 
Nucleic acid mimics are labeled with fluorescein or other 
fluorescent tags as illustrated in Example 3 above. Labeled 

15 nucleic acid mimics are contacted with cell or tissue 
samples suspected of abnormal ras expression under 
conditions in which specific hybridization can occur. The 
sample is then washed to remove unbound labeled nucleic acid 
mimic . Label remaining in the sample indicates bound 

20 nucleic acid (i.e. that which encodes for mutant ras) and is 
quantitated using a fluorimeter, fluorescence microscope or 
other routine means. 

A first cell or tissue sample suspected of 
expressing a point mutation in the H-ras gene is incubated, 

25 under conditions suitable for specific hybridization, with a 
fluorescein- labeled nucleic acid mimic which is targeted to 
codon 12, codon 13 or codon 61 of mutant H- ras mRNA. A 
second identical sample of cells or tissues is incubated, 
under conditions suitable for specific hybridization, with a 

30 second fluorescent ly- labeled nucleic acid mimic which is 
targeted to the same region of normal H-ras mRNA. The 
samples are then washed to remove unbound labeled nucleic 
acid mimics. Label remaining in the sample indicates bound 
nucleic acid and is quantitated using a fluorimeter or other 

35 routine means* The presence of mutant H-ras is indicated if 
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the first sample exhibits fluorescence but the second sample 
does not . 

EXAMPLE 6 

Inhibition of gene expression by nucleic acid mimics 
5 A preferred assay to test the ability of nucleic 

acid mimics to inhibit expression of the E2 mRNA of 
papillomavirus is based on the well -documented 
transact ivat ion properties of E2 . Spalholtz et al . , J. 
Virol., 61, 2128 (1987). A reporter plasmid (E2RE1CAT) is 
10 constructed to contain the E2 responsive element, which 
functions as an E2 -dependent enhancer. E2RE1CAT also 
contains the SV40 early promoter, an early polyadenylation 
signal and the chloramphenicol acetyl transferase (CAT) 
gene. Within the context of this plasmid, CAT expression is 
15 dependent upon expression of E2. The dependence of CAT 

expression upon the presence of E2 is tested by transfection 
of this plasmid into C127 cells transformed by BPV-1, 
uninfected C127 cells and C127 cells cotransf ected with 
E2RE1CAT and an E2 expression vector. 

2 0 A. Inhibition of BPV-1 E2 expression: BPV-1 

transformed C127 cells are plated in 12 -well plates. Twenty 
four hours prior to transfection with E2RE1CAT, cells are 
pretreated by the addition of complementary nucleic acid 
mimic to the growth medium at a final concentrations of 5, 
25 15 and 3 0 mM. The next day, cells are transf ected with 10 

fig of E2RE1CAT by calcium phosphate precipitation. E2RE1CAT 
(10 tig) and carrier DNA (PUC 19, 10 ng) are mixed with 62 
of 2 M CaCl 2 in a final volume of 250 }iL of H 2 0, followed by 
the addition of 250 iih of 2X HBSP (1.5 mM Na 2 P0 4 , 10 mM KCl, 
30 280 mM NaCl, 12 mM glucose and 50 mM HEPES, pH 7.0) and 

incubated at room temperature for 30 minutes. This solution 
(100 iiL) is added to each test well and allowed to incubate 
for 4 hours at 37°C. After incubation, the cells are 
glycerol shocked for 1 minute at room temperature with 15% 
35 glycerol in 0.75 mM Na 2 P0 4 , 5 mM KCl, 140 mM NaCl, 6 mM 

glucose and 25 mM HEPES, pH 7.0. After shocking, the cells 



WO 97/32888 



PCIYUS97/03584 



are washed 2X with serum- free DMEM and refed with DMEM 
containing 10% fetal bovine serum and nucleic acid mimic at 
the original concentration. Forty eight hours after 
transf ection, the cells are harvested and assayed for CAT 
5 activity. 

For determination of CAT activity, cells are 
washed 2X with phosphate -buf f ered saline and collected by 
scraping. Cells are suspended in 100 /iL of 250 mM Tris-HCl, 
pH 8.0, and disrupted by freeze -thawing three times. This 
10 cell extract (25 fih) is used for each assay. 

For each assay, the following are mixed together 
in a 1.5 mL Eppendorf tube and incubated at 3 7°C for one 
, aB5 hour: 25 yCLi of cell extract, 5 of 4 mM acetyl coenzyme 

i,C A, 18 jliL of H 2 0 and 1 /xL of 14 C- chloramphenicol , 40-60 mCi/mM. 

15 After incubation, chloramphenicol (acetylated and non- 
fh acetylated forms) is extracted with ethyl acetate and 

evaporated to dryness. Samples are resuspended in 25 /xL of 
fi\ ethyl acetate, spotted onto a tic plate and chromatographed 

* in chloroform: methanol (19:1). The chromatographs are 

{^1 2 0 analyzed by autoradiography. Spots corresponding to 

f|j acetylated and non-acetylated 14 C- chloramphenicol are excised 

^ from the tic plate and counted by liquid scintillation for 

; ]g quantitation of CAT activity. Nucleic acid mimics that 

depress CAT activity in a dose -dependent manner are 
25 considered to have a positive effect. 

B. Inhibition of HPV E2 expression: The assay 
for inhibition of human papillomavirus (HPV) E2 by nucleic 
acid mimics is essentially the same as that for BPV-1 E2 . 
For HPV assays, appropriate HPVs are cotransf ected into 
30 either CV-1 or A431 cells with PSV2NE0 using the calcium 
phosphate method described above. Cells which take up DNA 
are selected for culturing in media containing the 
antibiotic G418. G418 -resistant cells are then analyzed for 
HPV DNA and RNA. Cells expressing E2 are used as target 
35 cells for complementary studies. For each nucleic acid 
mimic, cells are pretreated as above, transf ected with 
E2RE1CAT and analyzed for CAT activity as described above. 
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Nucleic acid mimics are considered to have a positive effect 
if they can depress CAT activity in a dose -dependent manner. 



WO 97/32888 



PCTYUS97/03584 



m 

: iss 



28 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Nielson, Peter E. Et al * 
(ii) TITLE OF INVENTION: Substituted Nucleic Acid Mimics 
5 (iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Woodcock Washburn et al . 

(B) STREET: One Liberty Place 46th. Floor 

(C) CITY; Philadelphia 
10 (D} STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19103 

(v) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE: Floppy disk 
15 <B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 
{D) SOFTWARE: Patentln Release #1.0, Version #1.30 
(vi) CURRENT APPLICATION DATA: 

{A) APPLICATION NUMBER: N/A 
20 (B) FILING DATE: Herewith 

(C) CLASSIFICATION: 
(vii) Prior Application Data: 

(A) US Application Serial No.: 08/612,661 

(B) 08-MAR-1996 

25 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Caldwell, John W 

(B) REGISTRATION NUMBER: 28,937 

(C) REFERENCE /DOCKET NUMBER: IS IS -24 2 5 
(ix) TELECOMMUNICATION INFORMATION: 

30 (A) TELEPHONE: 215-568-3100 

(B) TELEFAX: 215-568-3439 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
<iii) HYPOTHETICAL; NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l; 
5 AAAAGGAGAG 10 



(2) INFORMATION FOR SEQ ID NO; 2: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE; DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GAGAGGAAAA 10 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
20 (B) TYPE; PNA 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: PNA 

(iii) HYPOTHETICAL: NO 
25 (iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTTTCCTCTC 10 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 10 base pairs 

(B) TYPE: PNA 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: PNA 

35 (iii) HYPOTHETICAL: NO 
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<iv) ANT I -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TTTTCNTCTC 10 

(2) INFORMATION FOR SEQ ID NO: 5: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
<B) TYPE: PNA 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 
10 (ii) MOLECULE TYPE; PNA 

(iii) HYPOTHETICAL: NO 
<iv) ANTI- SENSE: YES 
[% (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

M . TTTTCNTNTC 10 

ru 

\jj 15 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
~ (A) LENGTH: 10 base pairs 

5 <B) TYPE: PNA 

(C) STRANDEDNESS: single 
%j 20 (D) TOPOLOGY: linear 

l iU (ii) MOLECULE TYPE: PNA 

" iW (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
25 TTTTNNTCTC 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
3 0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

35 (iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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GTAGGTCACT 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GTAGATCACT 10 

(2) INFORMATION FOR SEQ ID NO : 9 : 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: PNA 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
20 (ii) MOLECULE TYPE: PNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGTCACCTAC 10 

25 (2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: PNA 

(C) STRANDEDNESS: single 
3 0 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: PNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
35 AGTCANCTAC 10 
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CLAIMS 

What is claimed is: 



1 . A nucleic acid mimic in admixture with at least one 
target molecule selected from the group consisting of nucleic 

5 acids, transcription factors, carbohydrates and proteins, said 
mimic comprising a non-naturally occurring backbone structure 
to which are appended a plurality of heterocyclic bases, 

at least one of said bases being substituted with at 
least one sterically bulky substituent at a position one, two 
10 or three atoms removed from the position of attachment of said 
base to the backbone. 

2 . The nucleic acid mimic according to claim 1 wherein 
said sterically bulky substituent is -R' , -OR', -SR' , -N(R') 2 / 
-C(R') 3 , -C(- X){R'>, -C(= X) (-Y-R')'or S(» 0) x . a <-Y-R') 

15 wherein: 

X is 0, S or NH; 

Y is O, S or NH; and 

R' comprises at least 3 atoms and is H, C^-Cgo-alky!, 
C 2 -C 50 -alkenyl, C 2 -C 50 -alkynyl, C 7 -C 50 -alkyl-aryl, C 6 -C S0 -aryl, C 10 - 

20 C 50 -naphthyl, C 12 -C 50 -biphenyl, C 7 -C 50 -aryl-alkyl , pyridyl, 
imidazolyl, pyrimidinyl, pyridazinyl, quinolyl, acridinyl, 
pyrrolyl, furanyl, thienyl, isoxazolyl, oxazolyl, thiazolyl and 
biotinyl, wherein R' can be substituted one or more times by 
-NO, -N0 2 , -S0 3 ", -CN, -OH, -NH 2 , -SH, -P0 3 2 ', -COOH, -F, -Cl, - 

25 Br and -I. 

3 . The nucleic acid mimic according to claim 1 wherein 
said base is a naturally or non-naturally occurring pyrimidine 
base . 

4 . The nucleic acid mimic according to claim 3 wherein 
30 said sterically bulky substituent is bound to C-6, C-5 or N-4 

of said naturally occurring pyrimidine base. 
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5, The nucleic acid mimic according to claim 4 wherein 
said sterically bulky substituent is bound to N-4 of said 
naturally occurring pyrimidine base. 

6 , The nucleic acid mimic according to claim 5 wherein 
5 said naturally occurring pyrimidine base is cytosine. 

7, The nucleic acid mimic according to claim 5 wherein 
said sterically bulky substituent is (C=0) -R' ' wherein R' ' is 
Ci-Cao-alkyl or C 6 -C 18 -aryl. 

8 . The nucleic acid mimic according to claim 7 wherein 

10 said sterically bulky substituent is (C=0)~C 6 H 5 . 

9 . A method for the determination of a nucleic acid 

comprising : 

providing a nucleic acid mimic ; 

incubating said nucleic acid mimic and said nucleic 
15 acid under conditions suitable for the formation of a duplex 
between said nucleic acid mimic and said nucleic acid; and 

determining the occurrence of said duplex as a 
measure of the occurrence of said nucleic acid; 

said nucleic acid mimic comprising a non-naturally 
20 occurring backbone structure to which are appended a plurality 
of heterocyclic bases , 

at least one of said bases being substituted with at 
leaoftL'one sterically bulky substituent at a position one, two 
or fc&ree atoms removed from the position of attachment of said 
25 base to the backbone. 
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10. A compound for the preparation of a nucleic acid 

mimic having the general formula: 

R3 

■j. 

R 1^%2 

wherein : 

5 R 1 is C 1 -C 4 -alkyl having at least one -COOP 1 , -NHP 1 , -OP 1 or -SP 1 
group; P 1 is hydrogen or a protecting group; 

R 2 is C x -C 4 alkyl substituted by -COOP 2 , -NHP 2 , -OP 2 or -SP 2 , 
wherein P 2 is hydrogen or a protecting group; 

M is a naturally or non-naturally occurring heterocyclic moiety 
10 bound to N by a one to three carbon linker; and 

R 3 is a sterically bulky substituent containing 3 or more non- 
hydrogen atoms . 



11. The nucleic acid mimic according to claim 1 having formula 
(I) : 

15 L 1 L 2 L n 

i- ' i» }■ 

(I) 

wherein : 

n is at least 2, 

each of L 1 -!/ 1 is independently selected from the 
20 group consisting of hydrogen, hydroxy, (C 1 -C 4 ) alkanoyl , 
naturally occurring nucleobases, non-naturally occurring 
nucleobases; aromatic moieties, DNA intercalators , nucleobase- 
binding groups, heterocyclic moieties, and reporter ligands, 
at least one of L 1 -!/ 1 being said base substituted with at least 
25 one sterically bulky substituent; 

each of C 1 -^ is (CR 6 R 7 ) y where R 6 is hydrogen and R 7 
is selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R 6 and R 7 are 
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independently selected from the group consisting of hydrogen, 
(C 2 -C 6 ) alkyl, aryl, aralkyl, heteroaryl, hydroxy, (C^-CJ alkoxy, 
(C 1 -C 6 ) alkylthio, NR 3 R 4 and SR 5 , where R 3 and R 4 are as defined 
above, and R 5 is hydrogen, (C x -C 6 ) alkyl , hydroxy-, alkoxy-, or 
alkylthio- substituted (Cj-Cg) alkyl, or R 6 and R 7 taken together 
complete an alicyclic or heterocyclic system; 

each of D x -D n is (CR 6 R 7 ) Z where R 6 and R 7 are as 
defined above; 

each of y and z is zero or an integer from 1 to 10, 
the sum y + z being greater than 2 but not more than 10; 

each of G^G* 1 ' 1 is -NR 3 CO-, -NR 3 CS-, -NR 3 SO- or -NR 3 S0 2 - 
, in either orientation, where R 3 is as defined above; 

each pair of A x -A n and B 1 -B n are selected such that: 

(a) A is a group of formula (Ila) , (lib) or (lie) 
and B is N or R 3 N + ; or 

(b) A is a group of formula (lid) and B is CH; 



(Ila) 



R 

(lib) 



20 



where : 



25 



R * 

(He) 



R 

(lid) 



X is O, S, Se, NR 3 , CH 2 or C(CH 3 ) : 



Y is a single bond, 0, S or NR 4 ; 

each of p and q is zero or an integer from 1 to 5 ; 
each of r and s is zero or an integer from 1 to 5; 
each R 1 and R 2 is independently selected from the 
group consisting of hydrogen, (C 1 ~C A ) alkyl which may be 
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hydroxy- or alkoxy- or alkylthio-substituted, hydroxy, alkoxy, 
alkylthio, amino and halogen; 

each of G 1 -G n - ± is -NR 3 CO-, -NR 3 CS-, -NR 3 SO- or -NR 3 S0 2 - 
, in either orientation, where R 3 is as defined above; 
5 Q is -C0 2 H, -CONR'R", -S0 3 H or -S0 2 NR'R'' or an 

activated derivative of -C0 2 H or -S0 3 H; and 

I is -NHR /,, R' / '' or -NR' ' ' C (O) R' ' ' ' , where R' , R", 
R' ' ' and R' ' ' ' are independently selected from the group 
consisting of hydrogen, alkyl, amino protecting groups, 
10 reporter ligands, intercalators, chelators, peptides, proteins, 
carbohydrates, lipids, steroids, oligonucleotides and soluble 
and non- soluble polymers. 

12. The nucleic acid mimic according to claim 11 
wherein said target molecule is a nucleic acid. 

15 13 . The nucleic acid mimic according to claim 11 

wherein said sterically bulky substituent is -R' , -OR' , -SR' , 
-N(R') 2 , -C(R') 3 , -C(=X)(R'), -C(« X) (-Y-R') or S ( = O) x _ 2 ( -Y- 
R' ) wherein : 

X is O, S or NH; 

20 Y is O, S or NH; and 

R' comprises at least 3 atoms and is H, C L -C 50 -alkyl # 
C 2 -C s0 -alkenyl, C 2 -C 50 -alkynyl, C 7 -C s0 -alkyl-aryl , C 6 -C 50 -aryl, C 10 - 
C 50 -napht hyl , C 12 - C 50 -bipheny 1 , C 7 - C so - ary 1 - alkyl , pyr idy 1 , 
imidazolyl, pyrimidinyl, pyridazinyl, quinolyl, acridinyl, 

25 pyrrolyl, furanyl, thienyl, isoxazolyl, oxazolyl, thiazolyl and 
biotinyl, wherein R' can be substituted one or more times by 
-NO, -N0 2 , -SO3-, -CN, -OH, -NH 2 , -SH, -P0 3 2 ", -COOH, -F, -CI, - 
Br and - I . 

14. The nucleic acid mimic according to claim 11 
3 0 wherein said base is a naturally or non-naturally occurring 
pyrimidine base. 
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15. The nucleic acid mimic according to claim 14 
wherein said sterically bulky substituent is bound to C-6, C-5 
or N-4 of said naturally occurring pyrimidine base. 

16. The nucleic acid mimic according to claim 15 
5 wherein said sterically bulky substituent is bound to N-4 of 

said naturally occurring pyrimidine base. 

17. The nucleic acid mimic according to claim 16 
wherein said naturally occurring pyrimidine base is cytosine. 

18. The nucleic acid mimic according to claim 16 
10 wherein said sterically bulky substituent is (C=0) -R' ' wherein 

R' ' is Ci-Cso-alkyl or C 6 ~C 18 -aryl. 

19. The nucleic acid mimic according to claim 18 
wherein said sterically bulky substituent is (C=0)-C 6 H 5 . 

20. The nucleic acid mimic according to claim 11 
15 having formula (Ilia) : 




(Ilia) 

wherein : 

each L is independently selected from the group 
20 consisting of hydrogen, phenyl, heterocyclic base moieties, 
including those substituted with a sterically bulky group or 
groups, naturally occurring nucleobases, and non-naturally 
occurring nucleobases, at least one L being said base 
substituted with at least one sterically bulky substituent; 
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each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 

n is an integer from 1 to 60; 
5 each of k, 1, and m is independently zero or an 

integer from 1 to 5; 

p is zero or 1; 

R h is OH, NH 2 or -NHLysNH 2 ; and 
R 1 is H or COCH 3 . 

10 21. The nucleic acid mimic according to claim 11 

having formula (Illb) : 




(Illb) 



wherein: 

15 each L is independently selected from the group 

consisting of hydrogen, phenyl, heterocyclic base moieties, 
including those substituted with a sterically bulky group or 
groups, naturally occurring nucleobases, and non-naturally 
occurring nucleobases, at least one L being said base 
20 substituted with at least one sterically bulky substituent; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids ; 

n is an integer from 1 to 60; 
25 each of k, 1, and m is independently zero or an 

integer from 1 to 5; 

p is zero or 1; 

R h is OH, NH 2 or -NHLysNH 2 ; and 
R 1 is H or COCH3. 
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As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as 
stated below next to my name; and 

I verily believe that I am the original, first and sole 
inventor (if only one name is listed below) or an original, first 
and joint inventor (if plural names are listed below) of the 
subject matter which is claimed and for which a patent is sought on 
the invention entitled: SUBSTITUTED NUCLEIC ACID MIMICS the 
specification of which: 

( ) is attached hereto. 

(XX) was filed on March 7, 1997 as International 
Application Serial No. PCT/US97 / 03584 and was 
amended on October 8, 1997 . 

I hereby state that I have reviewed and understand the 
contents of the above identified specification, including the 
claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose to the U.S.^ Patent and 
Trademark Office all information known to be material to the 
patentability of this application in accordance with 37 CFR § 1.56. 

I hereby claim foreign priority benefits under 35 U.S.C. 
§ 119 of any foreign application (s) for patent or inventor's 
certificate listed below and have also identified below any foreign 
application for patent or inventor's certificate having a filing 
date before that of any application on which priority is claimed: 

Country Number Date Filed Priority Claimed 
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I hereby claim the benefit under 35 U.S.C. § 120 of any- 
United States application (s) listed below and, insofar as the 
subject matter of 'each of the claims of this application is not 
disclosed in the prior United States application in the manner 
provided by the first paragraph of 35 U.S.C. § 112, I acknowledge 
the duty to disclose to the U.S. Patent and Trademark Office all 
information known to be material to patentability as defined in 37 
CFR § 1,56 which became available between the filing date of the 
prior application and the national or PCT international filing date 
of this application: 

Application Serial No* Filing Date Status (patented, 

pending) 



08/612,661 March 8, 1996 Pending 



I hereby appoint the following attorney (s) and/or 
agent (s) to prosecute this application and to transact all business 
in the Patent and Trademark Office connected therewith: John W. 
Caldwell and Joseph Lucci, Registration Nos. 2&*-SL3-7~~and 3,3/307 of 
the firm of WOODCOCK WASHBURN KURTZ MACKIEWICZ & NORRIS LLP, One 
Liberty Place - 46th Floor, Philadelphia, Pennsylvania 19103, and 
Herb Boswell and Laurel Bernstein, Registration Nos. 22^311 and 
ZT^l&Q, of ISIS Pharmaceuticals , Inc, 2292 Faraday Avenue, 
Carlsbad, California 92008. 



Address all telephone calls and correspondence to: 
Joseph JLuq c i 

WOODCOCK WASHBURN KURTZ MACKIEWICZ & NORRIS LLP 

Ghe^D^^ — 

PKtTid^^ 

Telephone No. 215-568-3100. 



I hereby declare that all statements made herein of my 
own knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false 
statements and the like so made are punishable " by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the 
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United States. Code and that such willful false statements may 
jeopardize- the validity of the application or any patent issued 
thereon . 
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